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We prove here the long conjectured fact that the regular simplex is the code 
of minimal error probability for tra?ismission over the infinite-band Gaussian 
channel. The code is actually optimal for a rather wide class of assumed 
channel noises. We also establish the optimality of several other codes for 
the band-limited Gaussian channel. 

I. INTRODUCTION 

Since its introduction by Shannon 1 and Kotel'nikov 2 nearly 20 years 
ago, the geometric representation of signals has played an important 
role in communication theory.* By this scheme, a variety of physically 
different time-continuous communication systems can all be reduced to 
the same geometric model. The problem of finding optimal signals for 
transmission then becomes a geometric one. This paper solves one such 
problem. 

In the model in question, signals to be transmitted are represented as 
points, or vectors from the origin, in a suitable finite dimensional Euclid- 
ean signal space S„ . The energy of any signal in S„ is proportional to the 
length of its representative vector; the bandwidth of the communication 
system is proportional to the dimension n of the signal space. Received 
signals are also represented by vectors in 8„ and the difference Z = 
Y — X between a transmitted signal X and the corresponding received 
signal Y is a vector random variable representative of the noise en- 
countered during transmission. In a model commonly considered, the 
probability density of Z depends only on its magnitude, i.e., 

P(Z1,Z2,--- ,Zn) =f(\Z\), (1) 

* A detailed description of this viewpoint along with some references to the 
intervening literature can be found in Chapters 4 and 5 of the recent book 3 by 
Wozencraft and Jacobs. 

1247 



1248 THE BELL SYSTEM TECHNICAL JOURNAL, OCTOBER 1960 

and/(- ) is an integrable nonnegative monotone decreasing function of 
its argument. We shall consider only this case in all that follows. 

Suppose the transmitter has a list of M signals, Xi , X 2 , • • • ,X M 
from which it selects the successive signals to be transmitted. We sup- 
pose these choices are made independently with equal probabilities and 
that the code, or list of possible sent signals, is known to the receiver. 
The receiver partitions 8„ into M disjoint regions (Ri , (R 2 , • • • , (Rm 
called decision regions. When the received signal lies in (R, , the receiver 
asserts that X< was transmitted. With this scheme, the probability of 
correct decoding is 

Q = i E [ /(I X - X,- 1) dxidxi ■ ■ ■ dx„ , (2) 

where X = (xi , x% , ■ • ■ , Xn) is a generic point in 8„ . 

With M and n given, how large can Q be made by proper choice of the 
code and decision regions? For a given code it is well known (see Ref. 3, 
Section 4.2, for example) that Q is maximized by choosing 

<SU = {X I I X - X,- I < I X - X,- 1, j * i\, (3) 

i = 1,2, • • • , M. That is, the ith. decision region consists of all points of 
8„ closer to X t - than to any other code word. Decision regions determined 
by (3) are known as maximum-likelihood regions. 

The maximization of Q over the code is more complicated. To obtain 
a meaningful problem it is necessary to put some restriction on the 
length of the code vectors, for without this, Q can be made arbitrarily 
close to unity by choosing large enough vectors in distinct directions. 
Several different energy restrictions have been studied in the literature 
(see Ref. 4). Although optimal codes under these restrictions have not 
been found in general for fixed M and n, much detail is known in the 
Gaussian case 

Kx) = °*P(-M>°'\ (4) 



about the asymptotic form of Q for such optimal codes, as n — * oo with 
(1/n) log M — > R. These results are usually described in the channel 
capacity and reliability formulae terms of information theory.' ' 

In this paper we restrict our attention to the case in which all code 
vectors are the same length. For convenience, we take 

|X,-| = 1, t= 1,2, ... ,M. (5) 

Such codes are called "equal energy codes". 6 The code optimization 
problem can then be stated as follows. Find M points Xi , X 2 , • • • , X A/ 
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on the unit sphere* in S„ such that Q as given by (2) and (3) attains 
its maximum value. 

To our knowledge, the first investigation of particular codes from this 
geometric point of view was earned out in 1948 by L. A. MacColl who 
investigated codes corresponding to the vertices of the three regular 
polytopes 8 in n-space. These are the regular simplex for which M = 
n + 1, the hypercube for which M = 2" and the cross-polytope or 
biorthogonal code for which M = 2n. MacColl wrote explicit expressions 
for Q for these codes and evaluated them numerically for the Gaussian 
case (4) for a variety of values of n and <r. Gilbert continued this work 
and made comparisons with a variety of other point configurations. 
Balakrishnan 10 established a new expression for Q in the Gaussian case, 
which permitted him to show that the regular simplex code is locally 
optimal (yields a larger Q than nearby equal energy codes with M = 
n + 1). Later lI he showed that as a — > ■» and as a — > the optimal 
code of n + 1 points approached the regular simplex. Weber 12 used 
Balakrishnan 's form for Q to show that for n = 2 the (globally) optimal 
code of M points, M = 3, 4, • • • , is the regular M-gon. For n = 2, 3, • • • , 
he also showed the biorthogonal code to be a local optimum among equal 
energy codes with M = 2n, and described a family of locally optimal 
codes for M = n + 1, n + 2, • • • , 2w. 

In this paper, we at last lay to rest the longstanding conjecture that 
the regular simplex is optimal for M — n + 1 in the Gaussian case.f 
Specifically, we show that Q as given by (2)- (3) is greater for the regular 
simplex than for any other equal energy code of M = n + 1 points in 
8„ , n = 3, 4, 5, • • • . This result is true for any monotone decreasing/. 
The method of proof is based on a generalization to higher dimensions of 
a theorem of Fejes-Toth 13 concerning expressions related to the form 

(2) in 3 dimensions. % Our methods also establish that the optimal equal 
energy codes with parameters M = 6, n = 3, and M = 12, n = 3 are, 
respectively, the biorthogonal code and the code consisting of the mid- 
points of the faces of the regular dodecahedron. We conclude with some 
comments about the biorthogonal code and about the reliability of the 
infinite-band Gaussian channel. 

II. AN INEQUALITY FOR Q 

For an equal energy code, the maximum-likelihood region (R, given by 

(3) can be determined as follows. Let 3C,-,- denote the hyperplane that 



* We shall hereafter use the caret "• to denote unit vectors. 

t It is incorrectly stated in Ref. 3, pp. 260, 364 that this result has been previ- 
ously shown in the literature. 

% We are indebted to E. N . Gilbert for calling Fejes-Toth 's work to our attention. 
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bisects perpendicularly the line segment joining X, to X,- . This plane 
passes through the origin and divides £„ into two half -spaces. We denote 
by IL.-j the half -space containing X, . It consists of all points of 8„ closer 
to X< than to Xy . The region (R, is the intersection of M-\ such half- 
spaces, 

M 

(R,- = n 01,7. 

It is, therefore, a convex region bounded by a certain number of hyper- 
plane faces that pass through the origin — a kind of flat-sided cone with 
vertex at the origin. We note that the various maximum-likelihood 
regions, (Ri , (R 2 , ' * ' , (Rm , are disjoint and that together with their 
boundaries they exhaust S n . 

Let us now call any convex region of S„ bounded by k ^ n hyperplanes 
through the origin a "flat-sided cone". We shall establish an upper bound 
for Q as given by (2) when the M decision regions (Ri are any set of 
disjoint flat-sided cones (not necessarily maximum-likelihood regions of 
any code) that together with their boundaries exhaust 8„ . For our 
purposes it suffices to consider only the case in which X; lies in the in- 
terior of (Ri , i = 1, 2, • • • , M. 

We denote by S the surface of the unit sphere in S„ with center at the 
origin. We denote by R t the intersection of (Ri with S. The regions Ri 
are "spherical polygons" that reticulate S into a map or net. We shall 
evaluate Q by first integrating over this net on $ and by then performing 
a radial integration. 

Let X be a generic point in 8„ distant r from the origin (see Fig. 1 ) 




Fig. 1 — Reduction to unit sphere. 

and let X be a unit vector in the direction of X, i.e., the terminus of X 
is the radial projection of the generic point onto S. Then 

| X t - - X | 2 = 1 + r 2 - 2r cos y 



and 



so that 
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X,- - X I 2 = 2 - 2 cos 7 



X, - X | 2 = (1 - r) 2 + r | Xi - X 



We can thus write /( | X» — X | ) = g» r ( | X,- — X | ) where for each fixed 
r the function g T ( • ) is nonnegative and is monotone decreasing in its 
argument. The expression (2) in these terms becomes 

Q = f drv n - l U(r) (6) 

Jo 

U(r) =±t,[ gr(\±i-t\)ds (7) 

where ds is the differential surface- or (n — 1) -content of S at the 
point X. Note that U ^ 0. We proceed to find an upper bound for U. 
By (6) this will provide the desired bound for Q. 

Let the terminus of the unit vector Y determine a point P on S (see 
Fig. 2). The set of all points X on S such X- Y ^ cos <p ^ will be called 
"the spherical cap of S of angle <p about P". Now let 50, be a hyperplane 
through the origin but not containing P that intersects this spherical 
cap. That is < n-Y < sin <p where ft is the unit normal to JC directed 
positively toward the side on which P lies. 3C divides the spherical cap 
into two parts. We denote by W the part of the cap not containing P, 
and we denote by w the content of W. 

In what follows, the function 

KM = f g r (\Y - X\)ds (8) 

will be of great importance to us. The notation suppresses the dependence 
of h on <p, the angle of the spherical cap, and points out that with the 




,-CAP 



~~x 



Fig. 2 — Cap cut by hyperplane. 
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geometry as described this integral is a function only of 

= f ds (9) 



W 

'W 



the content of W. We shall suppose <p fixed in all that follows. 

Two special properties of h r (w) are of particular concern. First, as 
shown in Appendix B, this function is increasing and convex. That is, if 
w 2 > wi , then h r ' (w 2 ) > h,' (w>i) where h r ' (w) = dh r /dw. This, of course, 
implies that 

EMrK) ^ hr(^ piU)i) , (10) 

where the p» are nonnegative weight factors summing to unity. Equality 
holds only when the to,- are all equal. 

The second property of h r (w) is somewhat more complicated to state, 
though in three dimensions it is intuitively obvious. Again let 3C be a 
hyperplane through the origin but not through P that cuts off a piece W 
of the spherical cap about P. Let 3Ci , 5C 2 , • • • , 3Cy be hyperplanes that 
each contain the origin and P. We denote by V the portion of W lying 
on the positive side of 0C,- , i = 1,2, • • • , j, and we denote the content of 
V by v. It is established in Appendix C that 



I 



g r (\?-±\)ds^hr(v), (11) 

where as before ¥ is the vector from the origin to P and a is a generic 
point of V. Equality holds only if 3C is the sole hyperplane boundary of V 
(i.e., if none of 3Ci , 3C 2 , • • • , 3C; form a part of the boundary of V). 

With these two properties of h r {w) we can now establish the desired 
inequality for U(r). We first "triangulate" each of the polygonal regions 
Ri into "spherical pyramids" Ri, having boundaries of Ri as bases and 
&,• as a vertex. More accurately described, the regions Ry are found as 
follows. The flat-sided cone (R» is bounded by pieces of ki (say) hyper- 
planes 3Ci (,) , • • • , 3C fri (t) through the origin. We denote by (B,y the portion 
of 3Cy (,) that bounds (R» . Now (B,y is itself bounded by a certain number 
Z,y of (n — 2 ) -flats through the origin. Through each of these (n — 2) -flats 
we pass a hyperplane 3C A * 3 , k = 1,2, • • • , Uj , that contains X,- . These 
hyperplanes, along with (B,-,- , determine a new flat-sided cone (Rn having 
(Rij as one face and the line containing X» as a one-dimensional boundary. 
The interiors of the fc, flat-sided cones <Ra , (R« , • • ■ , (R.-fc, are disjoint. 
Together with their boundaries they exhaust (R t - . The line through X,- 
is common to the boundaries of all fc, of these flat-sided cones. The 
spherical pyramid Ra is the intersection of (ft,,- with S. 

We denote by C\ the spherical cap of S of angle <p about X,- (see Fig. 3 ) . 
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— CL 




Fig. 3 — Cap and triangulated decision region. 

Let Ti be the portion of C, exterior to Ri and let Si be the portion of Ri 
exterior to C t - . A typical term of the sum (7 ) can then be written 

/ P (| X, - X |)ifa = (/ + / - / ) g r (\ % - 1 \)ds. 

Ri \ Ci S{ Ti/ 

Now Ti can be broken up into pieces 7\-y corresponding to the spherical 
pyramids Rtj . To accomphsh this, we extend the sides of the pyramid 
beyond its base. Thus, if SUj is on the positive side of 3C/° and 5C k lti) , 
k = 1, 2, • • • , Uj , then Ta is the part of the spherical cap on the nega- 
tive side of 3C/° and the positive side of 3C* ( *' } , k = 1, 2, ■ • • , Ui • We 
now have 

/ g r ds = / g r ds + / g r ds — 22 / 9rds. (12) 

J Ri J Ci J 8 i j"=l J Tij 

Some of the regions Si , Ta can, of course, be void. 
We now sum (12) over the M regions R t . We write 

i i=i 

for the total number of {n — 2) -boundaries in the net on S. (Each 
boundary of Ri is shared with one other spherical polygon. ) There results 

MU(r) = S [ g r (\Xi -X\)ds = M f ^(1^ - t\)ds 

+ E f gr{\t<-t\)d8-t,it f gr(\%-X)ds. 

t=l J 8i i=l j=l •'r,-,- 

We next use (11) for the regions T,y . 

MU(r) ^ M f g r (\ ±i - t \)ds + E f g r ds -EE KiU,) 

J Ci t-=l J 8i i— 1 ;'=1 

where Uj is the content of Ti,- . The convexity (10) of h now gives 
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MU{r) ^M f g r (\ti-t\)ds 



+ 



Mr / -J M hi 

t=l J8i \&K t=l ;=l 



(14) 



Now denote by s, c, and Si , respectively, the content of S, d , and Si . 
A division of S analogous to (13) (set g = 1 there) gives 

M 



M hi 

= M c + 23 s< - Z 23 fcy 

1 »=1 ;=1 

= M c + s' — 23 23 fc/ 

t=l 3=1 



(15) 



where we write s' = 23 s « f° r tne sum °f the contents of all of the pieces 
of the polygons Ri that fall outside their respective spherical caps. We 
then have from (14) and (15) 

MU(r) =S M J g r ds + g f g r ds - 2kh r (M2J=JL±J?) 

= M f c g r ds - 2kh r (^zj~) (16) 

+ 23 [ Qrds -2k f r (|? -X\)ds 

where K is the region (see Fig. 4) of the spherical cap about P that lies 
between hyperplanes through the origin that cut from the cap regions of 
content (Mc — s + s')/2k and (Mc — s)/2k. This latter quantity will 
henceforth be assumed to be nonnegative. The normals to the two hyper- 
planes and the vector Y from the origin to P are chosen coplanar. 

Note now that the sum of the last two terms in (16) cannot be posi- 
tive, for we have 




Fig. 4 — The region K. 
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£ f g T (\% -t\)ds£ g r (d) £/ dr - ^(d)s' (17) 



and 



2/o 



f £ r (|Y - *|)<fe ^ g r (d)2k f ds = g r (d)s (18) 



where d = \/2 — 2 cosy? is the distance from the center of the cap to the 
edge. Here we have used the fact that g r is monotone decreasing. Equality 
holds in (17) and (18) only when s = 0. 

In view of the above, the inequality in (16) can be continued by 
omitting the last two terms there and we then obtain our desired in- 
equality 



MU(r) ^ M f c r (| Y - X \)ds - 2kh r (^ F J ) ( 



19) 



where C is the spherical cap of angle <p about the terminus of Y and we 
require Mc — s ^ 0. Retracing all the inequalities used to derive (19), 
we see that the equality sign holds there if and only if s' = 0, all the 
regions Tij have equal content and each Ttj is a region cut off from the 
spherical cap by a single hyperplane. 

In closing this section we note one further fact. From the convexity of 
h r {x) it follows that xh r (a/x) is monotone decreasing in x. For given M 
and c, then, the right side of (19) is monotone increasing in k. 

III. OPTIMALITT OF THE REGULAR SIMPLEX AND CERTAIN OTHER CODES 

Let a code of M unit vectors in £„ have maximum-likelihood regions (R,- 
that reticulate the surface of the unit sphere into a net having fc (n — 2)- 
dimensional boundaries. We designate such a code by the symbol 
{n,M,k}. For certain values of the parameters n, M and k, there may 
exist codes for which a spherical cap angle <p can be found such that the 
conditions for equality hold in (19). We call such a code a symmetric 
{n,M,k} . By choosing C so that (19) is an equality for such a symmetric 
code, we see that the probability Q of no error for a symmetric {n,M,k} 
is greater than the no-error probability of any nonsymmetric {n,M,k\. 
Indeed, the concluding remark of Section II shows that the no-error 
probability of a symmetric {n,M,k} is greater than the no-error proba- 
bility of every \n,M,k') if k' < k. 

The regular simplex code consists of M = n + 1 unit vectors 
Xi , X 2 , • • ■ , X„ + i in S n with X.-Xy = — (1/n), i 9^ j. The maximum- 
likelihood region <R t containing X»- is bounded by n hyperplanes. It is 
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readily verified that the regular simplex is a symmetric {n, n + 1, 
n (n + l)/2}. Now no code of n + 1 unit vectors in S„ can have more 
than k = n(n + l)/2 (n-2) -boundaries in its maximum-likehhood net, 
for, by the construction given in the first paragraph of Section II, each 
maximum-likehhood region can be bounded by at most n hyperplanes. 
The regular simplex code then must have a Q strictly greater than any 
other equal energy code of n + 1 vectors in 8„ except possibly another 
distinct symmetric {n, n + 1, n(n + l)/2}, should such exist. But this 
latter eventuality cannot happen. That a symmetric {n, n + 1, 
n (n + l)/2} must be the regular simplex can be seen as follows. Since 
no (R,- for a code of M = n + 1 points can have more than n hyperplane 
boundaries, then to have k = n(n -+- l)/2, every (R,- must have exactly 
n hyperplane boundaries. The n hyperplanes 3Ci (1) , 3C2 (1) , • • ■ , 3C„ (1> that 
bound (Ri bisect, respectively, the line segments from Xi to X 2 , from &i 
to & 3 , • • ■ , from Xi to % n +i ■ Since the code is assumed symmetric, these 
hyperplanes must be equidistant from Xi . Thus, all the other code points 
are equidistant from X x . But a similar argument holds for each of the 
other regions (Rg , fltg , • • • , (R n +i and so all distances between pairs of 
code points are equal. But this property suffices to define the regular 
simplex. 

The optimality of two other codes in n = 3 dimensions can readily be 
established by using (19 ) . We note first that in 3 dimensions the condi- 
tions for equality to hold in (19) are such that the maximum-likelihood 
net on the sphere must be composed of congruent regular spherical 
polygons. A symmetric {3,M,k} then must be the radial projection onto 
the unit sphere of a regular three-dimensional polyhedron. The code 
points are the centers of the faces of the polyhedron. 

Consider now the code formed by the midpoints of the faces of a cube 
of edge length 2. This is the three-dimensional biorthogonal code. The 
maximum-likehhood net is given by the radial projection of the cube 
edges onto the inscribed unit sphere. The code is a symmetric {3,6,12}. 
There is no regular polyhedron with 6 faces other than the cube, so that 
we will have shown the three-dimensional biorthogonal code to be opti- 
mal if we establish that every {3,6,/c} must have k ^ 12. To see this latter 
fact, note that for three-dimensional codes, at least three edges of the 
maximum-likelihood net must meet at each vertex of the net (since each 
Ri is convex). Thus Sv ^ 2k where v and k are, respectively, the total 
number of vertices and edges for the net. Euler's formula (Ref . 8, p. 9 ) 
V — k + M = 2 holds for the net, and so 

k£Z(M-2). (20) 
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For the case at hand M = 6, and (20) gives k ^ 12, so that the proof 
is completed. 

Analogous reasoning shows that the centers of the faces of the dodeca- 
hedron give the best code with M = 12 points. The code is a symmetric 
{3,12,30}. 

The regular octahedron in S 3 gives rise to a symmetric {3,8,12} whose 
code points are the vertices of a cube. This is not the optimal configura- 
tion of 8 points in 8 3 . By rotating one face of the cube 45 degrees about 
an axis perpendicular to the face and through its center, one obtains a 
{3,8,10} . By translating this face slightly toward the opposite face of the 
cube, and by slightly expanding both faces, one obtains a {3,8,16} with 
minimum distance between code points strictly larger than the minimum 
for the cubic arrangement of points. There are then noise functions 
/( | z | ) of (1) for which this new code has a larger Q than the cube-code. 

It is not known whether the symmetric {3,20,30} obtained from the 
regular icosahedron is an optimal code of 20 points. 

IV. THE BIORTHOGONAL CODE 

The biorthogonal code is a symmetric {2n,n,n(2n — 2)}. The 2n 
code points can be taken as the points on the coordinate axes unit dis- 
tance from the origin. Alternatively, the code points can be described 
as the centers of the (n — 1) -dimensional bounding cells of the unit 
n-cube. The radial projection of the cube onto the unit sphere with center 
at the center of the hypercube gives the maximum-likelihood net of the 
code. 

We have seen that for n = 3 the biorthogonal code is optimal among 
codes of M = 2n = 6 points. It is natural to suspect that for all n the 
biorthogonal code is optimal among codes of 2n points in S„ . However, 
the methods used in this paper, based as they are on (19), will not suffice 
to settle this question, for, as will be shown below, when n ^ 4, there 
exist {2n,n,n(2n — 1)} codes; i.e., codes with a larger k value than the 
biorthogonal code. 

It might be thought that this encumbering dependence of (19) on k 
could be avoided — that an inequality for Q independent of k could be 
found which is attainable for optimal codes. The example already treated 
of the octahedron shows, however, that this dependence on A; is essential. 

To construct a \2n,n,?i(2n — 1)} for n ^ 5, choose 2n distinct real 
numbers v\ , v-i , • • • , vi n . The vectors of the code are given by 

X, = (a, , a.iVi , aiPi~, • • • , atu" ) , (21 ) 

where 
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cti = [1 + vf + ,/+..- + vl n ~V i= 1,2, ■■■ ,2n 

has been chosen so that X,- is a unit vector. The code is closely related to 
the cyclic polytope described by Gale . 

An important property of this code can be derived 14 by considering the 
polynomials 

F«(X) - (X - *)"(A - vif = t At^X* 

P =o (22) 

hj = 1, 2, • • • , 2n 

which are nonnegative. We define the (2n) 2 n-vectors 

B,,- - <£</», A,-/ 11 , A«» i,-/ 3) , A«,« 0, ... , 0). 
We then have 

Bij-t, = ai Z A|y W n» = FaM 

p=0 



o, 




I = i 


= 0, 




I = J 


[aw 


> o, 


I 9* i, 



(23) 



I 



where the positivity of the am follows from the factored form (22) of 

To show that the points (21) determine a {2n,n,n(2n — 1)} we 
note first that they span S„ . Indeed every choice of n vectors X t from 
(21 ) yields an independent set, as can be seen by forming the determinant 
whose rows are the components of the vectors. These determinants are 
proportional to Vandermonde determinants and do not vanish. To show 
that k = n(2n — 1) for the code, consider the maximum likelihood 
region (R, containing X, . By the construction described in the first 
paragraph of Section II, (R t - is the intersection of the half -spaces 

«/* (X) = (X,- - tj) -X ^ ; = 1, 2, • • • , 2n; j * i. (24) 

We assert that each of the 2n - 1 hyperplanes 3c/*\ j = 1,2, ■ ■ ■ , 2n 
with j ^ i, is indeed an (n — 1) -dimensional boundary of (R,- . It will 
then follow that k = \2n (2n — 1 ) since there are 2n maximum likeli- 
hood regions. That ac/° is an (n - 1) -boundary of (R,- results from the 
fact that there exists a point X contained in (R, that lies in 3C/° but not 
in 3C* W , k = 1, 2, • • • , 2w with k ^ i and k ^ j. From (23) we can 
choose Xo = B, j since 

3C, (0 (B.-,-) = 

Sfc w (B, y ) = a ijk > 0, k^i, k* j. 
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X,t = 77^ ( cos k - , sin k - , cos k — , sin k - 1 k = 1, 2, 



For ?i = 4, the configuration of eight points given by 

1 / TV r 

7= I cos k - , sin k - , cos k - 
V 2 \ 4 ' 4 ' 2 

is a {4,8,28}. The proof is similar to that just given for the case n ^ 5 
with the role of the polynomial F,-,-(X) being replaced here by the expres- 
sion 

*V*(X) = [l - cos (\ - i |Y] X [l - cos (\ - j |)1 . 

We omit the details. 

We close this section by noting that although we cannot show that the 
biorthogonal code has a largest Q value for codes of In points, it does 
have largest nearest neighbor distance, 90° in angular terms. Indeed no 
collection of more than n + 1 vectors in £„ can have minimum angular 
distance between points greater than 90°. For consider* Fig. 5. Without 

+ - - - - 

o + - - - 

0+ — - 



Fig. 5 — Table of component signs. 

loss of generality the positive .Ti-axis of a rectangular coordinate system 
can be chosen to lie along the first vector. The first column of the figure 
shows the sign of the components of this vector. The coordinate axes can 
be oriented so that X 2 lies in the x% — x% plane and the direction of the 
.ro-axis can be chosen so that the .To-component of X 2 is positive. The 
second column of Fig. 5 shows the sign of the components of X 2 . The first 
component must be negative since if the minimum distance is to be 
greater than 90° we must have Xi-Xo < 0. Continuing in this manner we 
are forced to choose the components of the X x , X 2 , • • • , X„+i as shown. 
But now it is impossible to find an (n + 2)nd vector having a negative 
scalar product with these n + 1 vectors, for if the nonzero components 
of X n+2 are all negative, it has a positive scalar product with X„ + i , 
whereas if the first positive component of X„ +2 is the jth, XyX„ +2 is 
positive. 

* This elegant proof was suggested by J. H. van Lint. 
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V. THE INFINITE-BAND GAUSSIAN CHANNEL 

When M = n + 1 and/(x) is given by (4), the model discussed here 
describes the transmission of M equally likely signals s,(£), i— 1,2, • • • , 
n + 1, of duration T in white Gaussian noise of spectral power density 
N/2. Here the signals are constrained by 



f 



s?(t)dt = PT. 



When these signals are transmitted, the probability of no error using the 
best possible detection scheme is given by (2), where the X,- must be 
chosen so that 



t*-% = 4ff s*(t)*i(t)dt, 



PT < 

the (R,- are the maximum-likelihood regions (3), and 



2 

a = 



N 
2PT 



See Ref . 3, Sections 4.2 and 4.3 or Ref. 15 for a more detailed description 
of the correspondence between the geometric model and the physical one. 
Our result that the simplex code is optimal means that in communicat- 
ing in infinite-band white Gaussian noise by means of M equally likely 
equal-energy signals of duration T (no bandwidth restrictions imposed) 
the error probability is minimized by choosing signals with normalized 
cross-correlation 

i ^ Si (t) Sj (t)dt = - j^-j , i^j (25) 

this being the value of X,--Xj for the regular simplex. 

The error probability with a best set of signals of form (25 ) is readily 
determined to be 

P. = !-/_>/(*)*»-■(*+ I yCI=), (26) 

where /(x) is the Gaussian density (4) and$ the cumulative 

*(y) = f f(x)dx. 

J— 00 

When the transmission rate 
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log M 



R = 



T 



is kept constant, along with N and P, (26) becomes for large T (and 
hence large M) 

P c = exp [-E(R)T + o(T)], (27) 

where 



E(R) = 



[(Vc - Vr)\ R ^ C/4 



(28) 



and C = P/N is the capacity of the channel. That the minimal asymp- 
totic error probability for this channel must have the form (27)- (28) 
was first proved by Wyner. 15 



APPENDIX A 



A Lemma 



The following lemma will be useful in establishing the main results of 
Appendices B and C. 

Lemma: Let Wi(x) andw?[x) be integr able functions that satisfy 

rb rb 

I wi(x)dx = I w 2 (x)dx. (29) 

J a J a 

Further, suppose there exists an x , a ^ x S b, such that 

Wi(x) ^ Wi(x), a ^ x S x 

(30) 
M>2 (#) ^ wi{x), x 5s x £g 6. 

Then, if m{x) is a nonnegative monotone increasing function, 

rb fb 

I m(x)wi(x)dx ^ / m(x)w 2 (x)dx. (31) 

If m(x) is a nonnegative monotone decreasing function, 

I m{x)w\(x)dx ^ / m(x)w 2 (x)dx. (32) 

Equality holds in (31) and (32) only if w\(x) = w 2 (x) for almost all x. 
Proof: If m(x) is nonnegative and monotone increasing, then 
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/ m(x)[wi(x) — w 2 (x)]dx 

= I m(x)[wi(x) - w 2 (x)]dx + / m(x)[wi(x) - w 2 (x)]dx 

•>a J x' 

^ m(x') / [wi(x) - w 2 (x)]dx + m{x) I [wi(x) - w 2 (x)]dx 

J a J x' 

= m(x') I wi(x)dx — J w 2 (x)dx = 0. 

If m(x) is nonnegative and monotone decreasing, the steps are the same 
with the inequalities reversed. 

APPENDIX B 

Convexity of h r (w) 

Let xi,x 2 , ■ • • ,x n be the rectangular coordinates of a point in 8„ . 
The surface S of the unit sphere centered at the origin can be given 
parametrically by 

Xi = cos 0i 

x 2 = sin 0i cos 02 



Xj = sin 0i sin 2 • • • sin 0y_i cos 6j 

x n -i = sin 0i sin 02 ■ • • sin 0„_ 2 cos 0„_i 
x n = sin 0i sin 2 • ■ • sin 0„_ 2 sin 0„_i 



(33) 



£ 0< < t, i = 1, 2, ■ • • , n - 2 

^ 0„_i < 2tt 

and the element of surface content is 

ds = sin" -2 0i sin" -3 2 • • • sin n _ 2 d0i dd 2 • ■ ■ dd„-i . (34) 

We shall only be concerned with the case n ^ 3. 

The spherical cap of angle <p about P, the end point of 

Y= (1,0,0, ••• ,0), 

is given by 0i ^ <p. A hyperplane 3C that intersects this spherical cap is 
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£2 = Xi tan a with ^ a < ip and the intersection of 5C with the spherical 
cap is from (33) 

cos ft = tan a cot ft . 

We then have from the definition (8) 

h r (w) = I tfft / tfft sin" -2 ft sin" -3 ftg r (V2 - 2 cos ft) 

/ de z '• I ddn-t / d»»-i sin" -4 3 • • • sin 0„_ 2 

where »> = arccos (tan a cot ft) and /x = — i»if n = 3 but m = if n ^ 4. 
In either event, we can write 

h r (w) = kn / ddi \ dd 2 sin" -2 ft sin" -3 ft>0 r (-\/2 - 2 cos ft) (35) 

J a ''O 

while for the content of the piece of the cap cut off by 3C we have 

w = k n I ddi J dd 2 sin" -2 ft sin" -3 ft (30) 

with k n > and independent of a. 

Straightforward differentiation of (35) yields 



rv 

= — k„ sec 2 a I ddi sin ft cos ft 



rf/l 

= — ic„ sec a J aft sin t/i 

''a 

[1 - sec 2 a cos 2 ft] (n-4)/2 g r (\/2 - 2 cos ft). 



Now introduce 

.x = cos 2 ft , a = cos 2 <p, 6 = cos 2 a 

and 

<7(a:) = gr(V2 - 2V*). 

We have ^ a < 5 ^ 1. Note that ^(x) > is monotone increasing in 
x. In these terms 



dh _ k 
&x~'~ ~ 26 



r b r a:l (n " 4)/2 



and 
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Combining these results we find 

dh r (w) dh/da f ,, •.*, x , . n /c i -\ 

j = ,, ,, s = \ lb(x)g(x)dx > 0, (37) 

aw ( dw/da ) J a 



where 



Note that 



/ A (n_4)/2 

™~ ji~i*'a\™ > a = X = b - (38) 

7i- 2 V 1 b) 



j l b (x)dx = 1. (39) 



When n > 4, the convexity of h r (w) can be established from (37)- 
(39) as follows. Consider two different w values, say w 2 > W\ with cor- 
responding parameters b 2 = cos 2 a 2 and 6i = cos 2 «i . We have 

1 ^ b 2 > 6i > a. 

From (38) one readily finds that there is a unique real root x for which 
lbt(x) = hiix), a < x < bi . For a S x ^ x we have l bi (x) ^ l bl (x). 
If we now define l b (x) = for x > b, we can also write l bl (x) ^ W(x) 
for x ^ x . From (39) we have 



/6 2 rbz 

l bl (x)dx = J l b ^{x)dx. 



The conditions of the lemma of Appendix A hold and we conclude from 
(37) that 

dhriwz) w dh r (wi) 
dw dw 

which is the desired convexity. 

When n = 4, (38) becomes l b (x) = (6 - a) -1 for a g x ^ 6. As be- 
fore, we define l b (x) = for .-r > 6. It is readily seen that the lemma 
again applies with x chosen as 6i . Convexity is then established in this 
case as well. 

For n = 3, (37) and (38) give 
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(lh r (w) 

dw 



f g(x)dx 



g(x] 



- - 2Vb - *wrr; [ + 1. |/S*« 

on integrating by parts. However, since g is increasing in x, it follows 
that the last integral is increasing in b and hence also in w. The convexity 
proof is thus completed. 

appendix c 

Proof of Equation (11) 

We shall be concerned here with two different regions, V and W, cut 
off from the spherical cap of angle ip about the point P which we take as 
the terminus of the unit vector Y in 8„ (see Fig. 6). The region V is the 
intersection of the spherical cap with a convex cone V having the origin 
as a vertex. It is assumed that V does not contain P. We denote by Q a 
point of V closest to P. The second region, W, is cut off from the cap by 
a single hyperplane £ through the origin but not through P. £ is chosen 
so that W and V have the same content, w and v, respectively, and for 
purposes of our proof we restrict the normal to £ to lie in the 2-plane 
through the origin, P and Q. We wish to show that 



I v = f 0,(| Y - X \)ds £: I w m f g T {\ Y - X |)di 

Jy J w 



(40) 



with equality holding only if V is cut off from the cap by a single hyper- 
plane. Here, as in (11), g T is nonnegative and monotone decreasing and 
X is a generic unit vector in £„ . In the applications made of (40) in the 
main text, V is specialized to a type of flat-sided cone. 




CAP 



B v (*,) 

Fig. 6 — Regions involved in proof of (11). 
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Let us again adopt the spherical coordinates (33) with the pole 
P of the cap located on the zi-axis so that t = (1,0,0, • • • , 0). We sup- 
pose the axes are oriented so that Q lies in the xi-x 2 plane. Then from 
(33) and (34) 

Iv = f ddig r (V2 ~ 2 cos 5) v(6i) (41) 

"'0 

where 

v(6i) = sin" -2 0i J dd 2 ■ • • j d0«_i sin""' 2 ■■• sin 0„_i .^) 

By(6 l ) 

is the in — 2) -dimensional content of the intersection B v {0\) of V with 
the hyperplane x\ = cos 0i . Similarly, 

I w = I %(V2 - 2 cos 0! ) «>(0i) (43) 

•'o 

where w> (0i) is the (n - 2) -dimensional content of the intersection B w {6\) 
of W with the hyperplane Xi = cos 0i . By hypothesis we have 

v = I dBivM = w = dOiw(Oi). (44) 

Since g r (\/2 — 2 cos 0i) is a nonnegative monotone decreasing function 
of di , all the hypotheses of the lemma of Appendix A will hold if we can 
show the existence of a <p such that 

v(0i)^w(0!), 0£«i£4»' (45) 

w(6i) ^ v(0i), <p £ 0i£<p. 

The conclusion (32) of the lemma then is (40). 

Our goal now, therefore, is to show that v (0) and w (0) cross only once 
as indicated in (45). Let a = ZPOQ. If Q* is the nearest point in W to 
P and (8 = ^POQ*, then /S > a. For ^ ^ a, both v(6) andw(0) 
are zero. For a < ^ fi, v(0) > w(0) - 0. From (44) it then follows 
that there is a first point in (0,<p) where w(0) crosses up through v(0), 
that is, where v(0) = w(d) and w (0) > v (0) where the prime denotes 
differentiation with respect to 0. If there were a second crossing, at that 
point we would have w < v. We prove that there is only one crossing 
by demonstrating that 

vM = «,(*) => -— ;> — (46) 

for ^ 0i ^ <p. 
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Let co be such that v (co) = w (co), ^ co ^ p. Consider now the spheri- 
cal pyramid r F having Q as vertex and as base the set B v (oj) defined 
below (42). r F is the set of all points X of the form 

X = £X Q + r]X B , 

— > fl (47) 

£^0, 7,2:0, X Q =OQ, X B eB v (u>) 

where, in order for (47 ) to be a unit vector, we have the additional re- 
striction 

I X | 2 = 1 = ? + v 2 + 2tok Q t B • (48) 

Note that since V is convex and since Q and B r (co) are contained in V, 
it follows from (47) that r F is contained in V and 8, hence IV is con- 
tained also in V. 

Now let v(6i) denote the (n — 2) -dimensional content of the inter- 
section of IV with the hyperplane X\ = cos 0i , where a ^ 0i ^co. We 
have 

v(co) = v(co) 

(49) 
5(« - 5) g v(co - 5) 

where this last follows from the fact that T is contained in V. One has 
then 



v(u) — v(qj — 8) K g(co) — i;(co — 5) 



so that 



dy(ffi) 



d0i 



sd 4r ' (50) 



Consider next the spherical pyramid T W (Q) having Q as vertex and 
as base the set B w (u) denned below (43). We denote by iZ)(0i) the 
(n - 2) -dimensional content of the intersection of IV with the hyper- 
plane a-i = cos 0i . As before, let Q* be the nearest point in W to P. We 
denote by w)*(0i) the (n — 2) -dimensional content of the intersection 
of the spherical pyramid T W (Q*) with the hyperplane Xi = cos 0i . 
Since Q* is contained in T W (Q), T w(Q*) is also contained in T W (Q) and 
we have 



jr (co) = w (co) 
?F'(co — 5) ^ u*(co — 8). 
From this it follows that 



(51) 
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dw(0i) I di5*(ei) 1 

However, since Q* lies in the hyperplane, 



IV* (to) = Itf(co) 



(52) 



for all w, hence 



dw(di) 



d»x 



dw(6i) 



9 t =u 



ddi 



(53) 



In the remaining paragraphs of this appendix we shall show that 



v(u) = w{o)) 



dv(dy 



ddi 



< 



dwM 



ddi 



(54) 



which will establish (46) and complete our proof, for the hypothesis 
of (46) follows from that of (54) by (49), (51), and (52) and the con- 
clusion of (46) follows from the conclusion of (54) by (50) and (53). 

Let the spherical coordinates of a point X in IV be denoted by the 
angles (<pi , • • • , <p n -i) (see Fig. 7). We employ the angles (0i , • • • , 0„-i) 
to describe a point X B in B v (9i). The content v(jt) of the intersection 
J(n) of T v with the hyperplane x t = cos p, a £ p ^ a is given by 

v(fi) = sin" -2 n I d<p2 sin n-3 «p2 / d<p 3 sin" <p 3 • • • J dip„-i . .^ 




x 2 



Fig. 7 — The mapping from X to Xg. 
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The relationships (47) -(48), however, serve to define a one-one trans- 
formation between the coordinates (/*, <p 2 , • • • , <p n -i) of a point in J(n) 
and the coordinates (co, 8 2 , • ■ ■ , 6 n -i) of a point in B v (a>), so that V(n) 
can be expressed as an integration over B F (w) as well. Taking compo- 
nents of (47), we find successively 

cos n = £ cos a + 7j cos co 

sin /x cos <p 2 = £ sin a + rj sin co cos 52 

sin /x sin <p 2 cos <p 3 = i\ sin co sin 2 cos d 3 



sin /i sin <p 2 • • • sin p,-_i cos p,- = r\ sin co sin v>2 ■ ■ • sin ^;-i cos <pj (56 ) 

sin /x sin <p 2 • ■ • sin ^>„_ 2 sin ^j n _i 

= ?j sin co sin ^> 2 ■ ■ ■ sin <p„_ 2 sin y>n-i 

j = 3, 4, • • • , n - 1. 

Dividing the nth equation by the (n — l)st yields <p n -i = 0,,-i • Dividing 
the (n — l)st equation by the (n — 2)nd then yields <p«-2 = 0„_ 2 . 
Proceeding in this manner, one finds <pj = 6j , j = 3, 4, • • • , n — 1. 
The first two equations of (56) can be solved for £ and rj. By substituting 
these expressions into (48) which now reads 

£ 2 + rj 2 + 2£tj (cos a cos co + sin a sin co cos 2 ) = 1, (57) 

we obtain a single relationship connecting <p 2 and 2 which we suppose 
solved in the form 

*»-*i(ft»M). (58) 

Equation (55 ) now becomes in the new variables 

v(n) = sin" -2 n I dd 2 -r£ sin" -3 <p 2 I dd s sin" -2 3 • ■ ■ / rf0 n -i 

Bv(co) (59) 

= sin" -2 co J de 2 G(6 2 , fi)h(d 2 ) 

with 

0ft.») = r-!^T- 2 * 2 Tsin^T- 3 (60) 

|_sin coj a02 L sm "2_ 
and 
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h(0%) = sin" -3 2 f dd z sin" -4 0, • •• f d0„_i , (61) 

where if n = 3 this latter expression is to be interpreted as unity. It is 
convenient now to define h(6 2 ) to be zero if 2 is not the second angle 
coordinate of a point in B v (u). In this notation, then, we have for 
n ^ 4 



dv(n) 



dn 






= sin" 2 co / dd 



H$ t ) (62) 



J«=W 



5(co) = sin n_2 co [* d$Ji(0t). (63) 

If n = 3, the lower limits of integration here should be replaced by — ir. 
It will be shown later that dG/dn | M= *, is a nonnegative monotone de- 
creasing function of 02 • 

We next seek to determine the nature of the set B v (o>) of given con- 
tent v(u) that will maximize (62). We note first from (61) that for n ^ 4 

h(6 2 ) =g a(0 s ) (64) 

where a(0 2 ) is the surface content of a sphere of radius sin 2 in S„_2 
since o- is given by the integrals of (61) with the integration variables 
running through their maximum allowable range. Now let B (to) be the 
set of points defined by 0i = to, ^ 2 ^ 9 2 where 2 ' is given by 

r9 2 ' 

u(co) = sin n-2 o> / d0 2 cr(0 2 ). 
For B*(w) we have 



so that 



We also have 



y(co) = sin" -2 to f* dd 2 h*(d 2 ). (66) 



h*(d 2 ) ^ ft(0 2 ), ^ 02 ^ 02 

h*(e 2 ) ^ h(e 2 ), e 2 ^ 02 ^ 7T 



(67) 



from (64) and (65). 
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The hypotheses of the lemma of Appendix A are thus met from (63 ) , 
(66), (67), and the mono tonicity of dG/dfj,. We conclude that among 
sets of equal content, dv/dn | M=U is a maximum for the set B*(co). The 
set B w (u), however, coincides with B (co). Equation (54) then follows 
for n ^ 4. The modification necessary to treat the case n = 3 is trivial. 

There remains only the demonstration that dG/dy. | p _, is nonnegative 
monotone decreasing in 2 . Equation (57) and the first two equations 
of (56) are identical with the equations that would hold for the three 
3-vectors OQ, X and X B of Fig. 7 constrained to satisfy (47). The rela- 
tionship (58) between <p 2 and 02 can most easily be written down by con- 
sulting this figure. The condition that the three points be coplanar with 
the origin is 



X 


y 


z 


Xq 


Vq 


Z Q 


Xb 


Vb 


z B 



= 



(68) 



where 



x = sin /i cos w 
x Q = sin a 
x B = sin co cos 02 



y = sin n sin <p 2 
y Q = 
y B = sin co sin 2 



z = cos /x 
z Q = cos a (69) 

Z B = COS CO 



which serves to determine (58). Routine implicit differentiation of (68) 
and (69) and evaluation at n = ia, <?2 = 02 yields 



d<p2 
d0 2 

d d<p2 

dn d0 2 



= 1 



sin a sin 2 



p=w sin co(cos a sin to — sin a cos a> cos 0o) 
sin a[cos a sin co cos 02 — sin a cos co] 



p=w 



sin co[cos a sin co — sin a cos co cos 02] 2 ' 



(70) 
(71) 
(72) 



The denominators of (71) and (72) are positive since co > a implies 
tan co cot a > 1 ^ cos 2 which is the same as 

cos a sin co > sin a cos co cos 2 . 

The numerator of (72) is nonnegative for points X B of interest to us 
since we are concerned only with points in the portion of the cap cut off 
by the hyperplane that passes through Q and through the origin and 
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has its normal lying in the plane POQ; i.e., points for which x 2 ^ X\ tan a. 
For points in this region on the sphere and in the hyperplane x\ = cos o> 
this inequality is 

sin to cos 82 ^ cos co tan a 

or 

cos a sin w cos &•> — sin a cos w ^ 0. 

Now from (60) and (70) 

ao 

Using (71) and (72), it is readily seen that this expression is nonnegative 
and monotone decreasing in 2 . 



, _* COS 03 . d d<p2 

= (w - 2) - + — -TT- 

sin w dp av2 



sin 6 2 dp. 
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